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a 



b 



c 



d 



e 



f 



1 
11 
16 
22 
29 
34 
41 
44 
51 
56 
63 
64 
69 
76 
81 
85 
92 
99 
105 
108 
115 
122 



Glu Gly Val Arg Ser Gly Asn Asn Leu Thr 
Val Thr Ser Ser Gly 



Gin Asp lie Ser Lys Lys 
Tyr Ala Asp Glu Val Glu Ser 
His Leu Glu Ser lie 
Leu Lys Asp Val Lys Lys Asn 
Leu Lys Lys 

Val Gin His Thr Gin Asn Val 
Gly Leu lie Thr Lys 
Leu Ser Glu lie Lys Lys Lys 

Tyr 

Leu Tyr Asp Leu Lys 
Val Asn Val Leu Ser Glu Ala 
Glu Leu Thr Ser Lys 
Thr Lys Glu Thr 
Lys Glu Lys Leu Thr Ala Thr 
Phe Glu Gin Phe Lys Lys Asp 



Thr Leu Pro Thr Glu Pro 



Glu Lys Lys 
Val Ala Glu Ala Gin Lys Lys 
Val Glu Glu Ala Lys Lys Lys 
Ala Glu Asp Gin 



126 Lys Glu Lys Asp Arg Arg Asn 

133 Tyr Pro Thr lie Thr 

138 Tyr Lys Thr Leu Glu Leu Glu 

145 lie Ala Glu Ser Asp Val Glu 

152 Val Lys Lys Ala Glu Leu Glu 

159 Leu Val Lys Val Lys Ala Lys 

166 Glu Ser Gin Asp Glu Glu Lys 

173 He Lys Gin Ala Glu Ala Glu 

180 Val Glu Ser Lys Gin Ala Glu 

187 Ala Thr Arg 

190 Leu Lys Lys He Lys Thr Asp 

197 Arg Glu Glu Ala Lys Arg Lys 

204 Ala Asp Ala Lys Leu Lys 

210 Glu Ala Val Glu Lys Asn 

216 Val Ala Thr Ser Glu Gin Asp 

223 Lys 

224 Pro Lys Arg Arg Ala Lys Arg Gly Val Ser 
234 Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu 
244 Asn Asp Ala Lys Ser Ser Asp Ser Ser Val 
254 Gly Glu Glu Thr Leu Pro Ser Pro Ser Leu 
264 Asn Met Ala Asn 

268 Glu Ser Gin 

271 Thr Glu His Arg Lys Asp 

277 Val Asp Glu Tyr 

281 He Lys Lys Met Leu Ser Glu 

288 He Gin Leu Asp Arg Arg Lys 

295 His Thr Gin Asn Val 

300 Asn Leu Asn He Lys 

305 Leu Ser Ala He Lys Thr Lys 

312 Tyr Leu Tyr Glu 

316 Leu Ser Val Leu Lys Glu Asn 

323 Ser Lys 

325 Lys Glu Glu Leu Thr Ser Lys 

332 Thr Lys Ala Glu 

336 Leu Thr Ala Ala Phe Glu Gin 




343 
346 
351 
354 
361 
368 
372 
379 
380 



Glu Lys Lys 
Val Ala Glu Ala Glu Lys Lys 
Val Glu Glu Ala Lys Lys Lys 

Ala Lys Asp Gin 
Lys Glu Glu Asp Arg Arg Asn 

Tyr 

Pro Thr Asn Thr 



Phe Lys Lys 



Asp Thr Leu Lys Pro 



384 Tyr Lys Thr Leu Glu Leu Glu 

391 lie Ala Glu Ser Asp Val Lys 

398 Val Lys Glu Ala Glu 

403 Leu Glu Leu Val Lys Glu Glu 

410 Ala Asn Glu Ser Arg Asn Glu 
417 Glu Lys He Lys Gin Ala 

423 Lys Glu Lys Val Glu Ser Lys 

430 Lys Ala Glu Ala Thr Arg 

436 Leu Glu Lys He Lys Thr Asp 

443 Arg Lys Lys Ala Glu Glu Glu 
450 Ala Lys Arg Lys 

454 Ala Glu Glu Ser Glu Lys Lys 

461 Ala Ala Glu Ala Lys Gin Lys 

468 Val Asp Ala Glu Glu Tyr Ala 
475 Leu Glu Ala Lys 

479 He Ala Glu Leu Glu Tyr Glu 

486 Val Gin Arg Leu Glu Lys Glu 

493 Leu Lys Glu 

496 He Asp Glu Ser Asp Ser Glu 
503 Asp Tyr Leu Lys Glu Gly 

509 Leu Arg Ala 

512 Pro Leu Gin Ser Lys 

517 Leu Asp Thr Lys Lys Ala Lys 

524 Leu Ser Lys 

527 Leu Glu Glu Leu Ser Asp Lys 

534 He Asp Glu Leu Asp Ala Glu 

541 He Ala Lys Leu Glu Val Gin 

548 Leu Lys Asp Ala Glu Gly Asn 
555 Asn Asn 

557 Val Glu Ala Tyr Phe Lys Glu 
564 Gly Leu Glu Lys Thr 

569 Thr Ala Glu Lys Lys 

574 Ala Glu Leu Glu Lys Ala 

580 Glu Ala Asp Leu Lys Lys Ala 

587 Val Asp Glu 

Figure 3. The coiled-coil motif of the a-heUx of PspC. Amino acids that are not in the 
coiled-coil motif are in the right column. This is the output from the Matcher program 
accessed through the Internet (http://catt.poly.edu/-jps/). 




Clade B 



BG8090 



DBL6A 





0.05 changes 



FIGURE 5. Choline-binding Domain 



PspC consensus 



11111111112 



12345678901234567890 

1 KTGWKQENGNWYFYNTDGSMA 

2 TGWLQNNGSWYYLNAMGAMA 

3 TGWLQNNGSWYYLNANGSMA 

4 TGWLQNNGSWYYLNANGAMA 

5 TGWLQYNGSWYYLNANGDMA 

6 TGWLQYNGSVvJYYLNSNGAMA 

7 TGVJLQYNGSIAJYYLNANGDMA 

8 TGWLQNNGSWYLNANGDMA 

9 TGWLQYN SWYLNANGDMA 

10 TGWVKDGDTVJYYLEASGAMKA 

11 SQWFKVSDKWYYVNGSGALA 

VNTTVDGYGVNANGEWVN 18 amino acid tail PspC 

28% identical to consensus 

TGWLQNNGSV^FYYLNANGAMA PspC consensus repeat 



For PspA/R36A alingement see Yother and Briles 1992. 
TGWLQXNGS\AJYYLNANGAMA PspA/Rxl consensus 



VNTTVDGYKVNANGEWV_ PspA/Rxl 17 amino acid tail. 

One AA different from PspC, 
And one AA shorter. 



Y 



Y 



95% identical to PspC consensus 





PspA/EF5668 

11111111112 
12345678901234567890 

1 IGWKQENGMWYFYNTDGSM A 

2 TGWLQNNGSWYYLNSNGAM A 

3 TGWLQYNGSWYYLNANGAM A 

4 TGWLQYNGSWYYLNANGAM A 

5 TGWLQYNGSV'JYYLNANGDM A 

6 TGWLQYNGSWYYLNANGDM A 

7 TGWAKVHGSWYYLNANGSM A 

8 TGWVKDGETWYYLEASGSMKA 

9 NQWFQVSDKWYYVNGLGSL S 

10 VNTTVDGYKVNANGEWV 17 amino acid tail EF5668 PspA 

TGWLQYNGSVJYYLNANGSM A PspA/EF5668 consensus 

90% identical to PspC consensus 



Repeat #1 
PspC consensus 
PspA/Rxl 
PspA/EF5668 



KTGWKQENGNl'^YFYNTDGSMA 
TGWKQENGMWYFYNTDGSMA 
IGWKQENGMWYFYNTDGSM A 



Repeat #N-1 
PspC consensus 
PspA/Rxl 
PspA/EF5668 



TGWKDGDTWYYLEASGAMKA 
TGWVKDGDTWYYLEASGAMKA 
TGIWKDGETOTYLEASGSMKA 



Repeat #N 

PspC consensus 

PspA/Rxl 

PspA/EF5668 



SQWFKVSDKWYYVNGSGALA 
SQWFKVSDKWYYVNGLGALA 
NQWFQVSDKVJYYVNGLGSLS 



17-18 AA tail 

PspC consensus 

PspA/Rxl 

PspA/EF5668 



VNTTVDGYGVNANGEWVN 

VNTTVDGYKVNANGEWV 

VNTTVDGYKVNANGEWV 
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serum to Pspv^ 



. Reciprocal titer °f serum 




immunized 
Non-immun\zed 



^ S,— UA55 

Antigen 

of sera from five mice, 
titer is 



Other Formats: 
Links: 



LOCUS CAA05158 539 aa BCT 14-OCT-1997 

DEFINITION SpsA protein. 
ACCESSION CAA05158 
PID g2576331 

VERSION CAA05158.1 01:2576331 

DBSOURCE embl locus SPSPS A2, accession AJ002054. 1 

KEYWORDS , 

SOURCE Streptococcus pneumoniae. 
ORGANISM Streptococcus pneumoniae 

Bacteria; Firmicutes; Bacillus/Clostridium group; Streptococcaceae; 
Streptococcus. 
REFERENCE 1 (residues 1 to 539) 
AUTHORS Hammerschmidt,S. 
TITLE . Direct Submission 

JOURNAL Submitted (13-OCT-1997) Hammerschmidt S., Microbiology, Devision 
Microbial Pathogenesis, National Research Centre for Biotechnology, 
Spielmannstrasse 7, 38106 Braunschweig, GERMANY 
REFERENCE 2 (residues 1 to 539) 

AUTHORS Hammerschmidt,S., Talay,S.R., Brandtzaeg,P. and Chhatwal,G.S, 
TITLE SpsA, a novel pneumococcal surface protein with specific binding to 

secretory immunoglobulin A and secretory component 
JOURNAL Mol. Microbiol. 25 (6), 1113-1 124 (1997) 
MEDLINE 98010350 
FEATURES Location/Qualifiers 
source 1..539 

/organism="Streptococcus pneumoniae" 
/strain="type 2" 
/db_xref="taxon:1313" 
Protein 1..539 

/function="IgA binding protein" 
/product="SpsA protein" 
CDS 1..539 

/db_xref="SPTREMBL:033741" 
/coded_by=" AJ002054. 1 : L. 1 620" 
/transljabie=ll 

ORIGIN 

1 mfaskserkv hysirkfsig vasvavaslv mgswhaten egstqaatss nmaktehrka 
61 akqwdeyie kmlreiqldr rkhtqnvaln iklsaiktky Irelnvleek skdelpseik 
121 akldaafekf kkdtlkpgek vaeakkkvee akkkaedqke edrmyptnt yktleleiae 
181 fdvkvkeael elvkeeakef megtikqak ekveskkaea trleniktdr kkaeeeakrk 
241 aaeedkvkek paeqpqpapa tqpekpapkp ekpaeqpkae ktddqqaeed yarrseeeyn 
301 rltqqqppkt ekpaqpstpk tgwkqengmw yfyntdgsma tgwlqnngsw yylnangama 
361 tgwlqnngsw yylnangsma tgwlqnngsw yylnangama tgwlqyngsw yylnsngama 



421 tgwiqyngsw yylnangdma tgwlqrmgsw yylnangdma tgwlqyngsw yylnangdma 
481 tgwvkdgdtw yyletsgamk asqwfkvsdk wyyvhgssal ainttvygyg vnangewvn 



the above report in format 



# 

Other Formats: 
Links: 

LOCUS SPSPSA2 1620 bp DNA BCT 14-OCT-1997 

DEFINITION Streptococcus pneumoniae SIgA binding. 

ACCESSION AJ002054 

NID g2576330 

VERSION AJ002054.1 01:2576330 

KEYWORDS SpsA protein. 

SOURCE Streptococcus pneumoniae. 

ORGANISM Streptococcus pneumoniae 

Bacteria; Firmicutes; Bacillus/Clostridium group; Streptococcaceae; 
Streptococcus. 
REFERENCE 1 (bases 1 to 1620) 

AUTHORS Hammerschmidt,S. 

TITLE Direct Submission 

JOURNAL Submitted (13-OCT-1997) Hammerschmidt S., Microbiology, Devision 
Microbial Pathogenesis, National Research Centre for Biotechnology, 
Spielmannstrasse 7, 38106 Braunschweig, GERMANY 
REFERENCE 2 (bases 1 to 1620) 
AUTHORS Hammerschmidt,S., Talay,S.R., Brandtzaeg,P. and Chhatwal,G.S. 
TITLE SpsA, a novel pneiunococcal surface protein with specific binding to 

secretory immunoglobulin A and secretory component 
JOURNAL Mol. Microbiol. 25 (6), 1 1 13-1 124 (1997) 
MEDLINE 98010350 
FEATURES Location/Qualifiers 
source 1..1620 

/organism=" Streptococcus pneumoniae" 
/strain="type 2" 
/db_xre^"taxon:1313" 
CDS 1..1620 

/function="IgA binding protein" 
/codon_start=l 
/transl_table=l 1 
/product="SpsA protein" 
/protein_id="CAA05 158.1" 
/db_xref="PID:e354783" 
/db_xref="PID:g2576331" 
/db_xref="GI:2576331" 
/db_xref="SPTREMBL:033741 " 

/translation="MFASKSERKVHYSIRKFSIGVASVAVASLVMGSVVHATENEGST 

QAATSSNMAKTEHRKAAKQVVDEYIEKMLREIQLDRRKHTQNVALNIKLSAIKT 
KYLR 



ELNVLEEKSKDELPSEIKAKLDAAFEKFKKDTLKPGEKVAEAKKKVEEAKKKAE 
DQKE 

EDRRNYPTNTYKTLELEIAEFDVKVKEAELELVKEEAKEFRNEGTIKQAKEKVES 
KKA 

EATRLENIKTDRKKAEEEAKRKAAEEDKVKEKPAEQPQPAPATQPEKPAPKPEK 
PAEQ 

PKAEKTDDQQAEEDYARRSEEEYNRLTQQQPPKTEKPAQPSTPKTGWKQENGM 
WYFYN 

TDGSMATGWLQNNGSWYYLNANGAMATGWLQNNGSWYYLNANGSMATGW 
LQNNGSWYY 

LNANGAMATGWLQYNGSWYYLNSNGAMATGWLQYNGSWYYLNANGDMAT 
GWLQNNGSW 

YYLNANGDMATGWLQYNGSWYYLNANGDMATGWVKDGDTWYYLETSGAM 
KASQWFKVS 

DKWYYVHGSSALAINTTVYGYGVNANGEWVN" 
repeat_region 96 1 1 564 

/rpt_type=DIRECT 
BASE COUNT 625 a 294 c 370 g 331 1 
ORIGIN 

1 atgtttgcat caaaaagcga aagaaaagta cattattcaa ttcgtaaatt tagtattgga 
61 gtagctagtg tagctgttgc cagtcttgft atgggaagtg tggttcatgc gacagagaac 
121 gagggaagta cccaagcagc cacttcttct aatatggcaa agacagaaca taggaaagct 
181 gctaaacaag tcgtcgatga atatatagaa aaaatgttga gggagattca actagataga 
241 agaaaacata cccaaaatgt cgccttaaac ataaagttga gcgcaattaa aacgaagtat 
301 ttgcgtgaat taaatgtttt agaagagaag tcgaaagatg agttgccgtc agaaataaaa 
361 gcaaagttag acgcagcttt tgagaagttt aaaaaagata cattgaaacc aggagaaaag 
421 gtagcagaag ctaagaagaa ggttgaagaa gctaagaaaa aagccgagga tcaaaaagaa 
481 gaagatcgtc gtaactaccc aaccaatact tacaaaacgc ttgaacttga aattgctgag 
541 ttcgatgtga aagttaaaga agcggagctt gaactagtaa aagaggaagc taaagaattt 
601 cgaaacgagg gcacaattaa gcaagcaaaa gagaaagttg agagtaaaaa agctgaggct 
661 acaaggttag aaaacatcaa gacagatcgt aaaaaagcag aagaagaagc taaacgaaaa 
721 gcagcagaag aagataaagt taaagaaaaa ccagctgaac aaccacaacc agcgccggct 
781 actcaaccag aaaaaccagc tccaaaacca gagaagccag ctgaacaacc aaaagcagaa 
841 aaaacagatg atcaacaagc tgaagaagac tatgctcgta gatcagaaga agaatataat 
901 cgcttgactc aacagcaacc gccaaaaact gaaaaaccag cacaaccatc tactccaaaa 
961 acaggctgga aacaagaaaa cggtatgtgg tacttctaca atactgatgg ttcaatggca 
1021 acaggatggc tccaaaacaa cggttcatgg tactatctaa acgctaatgg tgctatggcg 
1081 acaggatggc tccaaaacaa tggttcatgg tactatctaa acgctaatgg ttcaatggca 
1141 acaggatggc tccaaaacaa tggttcatgg tactacctaa acgctaatgg tgctatggcg 



1201 acaggatggc tccaatacaa tggttcatgg tactacctaa acagcaatgg cgctatggcg 
1261 acaggatggc tccaatacaa tggctcatgg tactacctca acgctaatgg tgatatggcg 
1321 acaggatggc tecaaaacaa cggttcatgg tactacctca acgctaatgg tgatatggcg 
1381 acaggatggc tccaatacaa cggttcatgg tattacctca acgctaatgg tgatatggcg 
1441 acaggttggg tgaaagatgg agatacctgg tactatcttg aaacatcagg tgctatgaaa 
1501 gcaagccaat ggttcaaagt atcagataaa tggtactatg tccatggctc aagtgccctt 
1561 gcaatcaaca caactgtata tggctatgga gtcaatgcca atggtgaatg ggtaaactaa 



the above report in format 



Other Formats: 
Links: 

LOCUS CAA05159 581 aa BCT 14-OCT-1997 

DEFINITION SpsA protein. 
ACCESSION CAA05159 
PID g2576333 

VERSION CAA05 159.1 01:2576333 
DBSOURCE embl locus SPSPSA47, accession AJ002055. 1 
KEYWORDS . 

SOURCE Streptococcus pneumoniae. 
ORGANISM Streptococcus pneumoniae 

Bacteria; Firmicutes; Bacillus/Clostridium group; Streptococcaceae; 
Streptococcus. 
REFERENCE 1 (residues 1 to 581) 
AUTHORS Hammerschmidt,S. 
TITLE Direct Submission 

JOURNAL Submitted (13.0CT-1997) Hammerschmidt S., Microbiology, Devision 
Microbial Pathogenesis, National Research Centre for Biotechnology, 
Spielmannstrasse 7, 38106 Braunschweig, GERMANY 
REFERENCE 2 (residues 1 to 581) 
AUTHORS Hammerschmidt,S., Talay,S.R., Brandtzaeg,P. and Chhatwal,G.S. 
TITLE SpsA, a novel pneumococcal surface protein with specific binding to 

secretory immimoglobulin A and secretory component 
JOURNAL Mol. Microbiol. 25 (6), 1113-1 124 (1997) 
MEDLINE 98010350 
FEATURES Location/Qualifiers 
source 1..581 

/organism^" Streptococcus pneumoniae" 
/strain="type 47" 
/db_xref="taxon:1313" 
Protein 1..581 

/function="IgA binding protein" 
/product="SpsA protein" 
CDS 1..581 

/db_xref^"SPTREMBL:033742" 
/coded_by=" AJ002055 . 1 : 1 .. 1 746" 
/transl_table=ll 

ORIGIN 

1 mfaskserkv hysirkfsig vasvavaslv mgswhaten egstqaatss nmaktehrka 
61 akqvvdeyie kmlreiqldr rkhtqnvaln iklsaiktky Irelnvleek skdelpseik 
121 akldaafekf kkdtlkpgek vaeakkkvee akkkaedqke edrmyptnt yktleleiae 
181 fdvkvkeael elvkeeakes megtikqak ekveskkaea trleniktdr kkaeeeakrk 
241 adaklkeanv atsdqgkpkg rakrgvpgel atpdkkenda kssdssvgee tlpssslksg 
301 kkvaeaekkv eeaekkakdq keedrmypt ntyktldlei aesdvkvkea elelvkeeak 
361 eprdeekikq akakveskka eatrlenikt drkkaeeeak rkaaeedkvk ekpaeqpqpa 




421 patqpekpap kpekpaeqpk aektddqqae edyarrseee ynrliqqqpp ktekpaqpft 

481 pktgwkqeng mwyfyntdgs matgwlqyng swyylnangd matgwvkdgd twyyleasga 

541 mkasqwfkvs dkwyyvngsg alavnttvdg ygvnangewv n 

// 




the above report in format 




Links: 

LOCUS SPSPSA47 1746 bp DNA BCT 14-001-1997 

DEFINITION Streptococcus pneumoniae SIgA binding. 

ACCESSION AJ002055 

NID g2576332 

VERSION AJ002055.1 01:2576332 

KEYWORDS SpsA protein. 

SOURCE Streptococcus pneumoniae. 

ORGANISM Streptococcus pneumoniae 

Bacteria; Firmicutes; Bacillus/Clostridium group; Streptococcaceae; 
Streptococcus. 
REFERENCE 1 (bases 1 to 1746) 

AUTHORS Hammerschmidt,S. 

TITLE Direct Submission 

JOURNAL Submitted (13-OCT-1997) Hammerschmidt S,, Microbiology, Devision 
Microbial Pathogenesis, National Research Centre for Biotechnology, 
Spielmannstrasse 7, 38106 Braunschweig, GERMANY 
REFERENCE 2 (bases 1 to 1746) 

AUTHORS Hammerschmidt,S., Talay,S.R., Brandtzaeg,P. and Chhatwal,G.S. 
TITLE SpsA, a novel pneumococcal surface protein with specific binding to 

secretory immunoglobulin A and secretory component 
JOURNAL Mol. Microbiol, 25 (6), 1113-1 124 (1997) 
MEDLINE 98010350 

FEATURES Location/Qualifiers \ 
source 1..1746 

/organism^" Streptococcus pneumoniae" 

/strain-"type 47" 

/db^xref^"taxon:1313" 
CDS L.1746 

/function="IgA binding protein" 

/codon_start=l 

/transljable=ll 

/product^" SpsA protein" 

/proteinJd-"CAA05159,l" 

/db„xref="PID:e354769" 

/db__xref-"PID:g2576333" 

/db_xref="GI:2576333" 

/db_xref="SPTREMBL:033742" 

/translation-"MFASKSERKVHYSIRKFSIGVASVAVASLVMGSVVHATENEGST 



QAATSSNMAKTEHRKAAKQVVDEYIEKMLREIQLDRRKHTQNVALNIKLSAIKT 
KYLR 



ELNVLEEKSKDELPSEIKAKLDAAFEKFKKDTLKPGEKVAEAKKKVEEAKKKAE 
DQKE 

EDRRNYPTNTYKTLELEIAEFDVKVKEAELELVKEEAKESRNEGTIKQAKEKVES 
KKA 

EATRLENIKTDRKKAEEEAKRKADAKLKEANVATSDQGKPKGRAKRGVPGELA 
TPDKK 

ENDAKSSDSSVGEETLPSSSLKSGKKVAEAEKKVEEAEKKAKDQKEEDRRNYPT 
NTYK 

TLDLEIAESDVKVKEAELELVKEEAKEPRDEEKIKQAKAKVESKKAEATRLENIK 
TDR 

KKAEEEAKRKAAEEDKVKEKPAEQPQPAPATQPEKPAPKPEKPAEQPKAEKTDD 
QQAE 

EDYARRSEEEYNRLIQQQPPKTEKPAQPFTPKTGWKQENGMWYFYNTDGSMAT 
GWLQY 

NGSWYYLNANGDMATGWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVN 
GSGALAVNT 

TVDGYGVNANGEWVN" 
repeat_region 1447.. 1690 

/rpt_type=DIRECT 
BASE COUNT 722 a 290 c 408 g 326 1 
ORIGIN 

1 atgtttgcat caaaaagcga aagaaaagta cattattcaa ttcgtaaatt tagtattgga 
61 gtagctagtg tagctgttgc cagtcttgtt atgggaagtg tggttcatgc gacagagaac 
121 gagggaagta cccaagcagc cacttcttct aatatggcaa agacagaaca taggaaagct 
181 gctaaacaag tcgtcgatga atatatagaa aaaatgttga gggagattca actagataga 
241 agaaaacata cccaaaatgt cgccttaaac ataaagttga gcgcaattaa aacgaagtat 
301 ttgcgtgaat taaatgtttt agaagagaag tcgaaagatg agttgccgtc agaaataaaa 
361 gcaaagttag acgcagcttt tgagaagttt aaaaaagata cattgaaacc aggagaaaag 
421 gtagcagaag ctaagaagaa ggttgaagaa gctaagaaaa aagccgagga tcaaaaagaa 
481 gaagatcgtc gtaactaccc aaccaatact tacaaaacgc ttgaacttga aattgctgag 
541 ttcgatgtga aagttaaaga agcggagctt gaactagtaa aagaggaagc taaagaatct 
601 cgaaacgagg gcacaattaa gcaagcaaaa gagaaagttg agagtaaaaa agctgaggct 
661 acaaggttag aaaacatcaa gacagatcgt aaaaaagcag aagaagaagc taaacgaaaa 
721 gcagatgcta agttgaagga agctaatgta gcgacttcag atcaaggtaa accaaagggg 
781 cgggcaaaac gaggagttcc tggagagcta gcaacacctg ataaaaaaga aaatgatgcg 
841 aagtcttcag attctagcgt aggtgaagaa actcttccaa gctcatccct gaaatcagga 
901 aaaaaggtag cagaagctga gaagaaggtt gaagaagctg agaaaaaagc caaggatcaa' 
961 aaagaagaag atcgccgtaa ttacccaacc aatacttaca aaacgcttga ccttgaaatt 



1021 gctgagtccg atgtgaaagt taaagaagcg gagcttgaac tagtaaaaga ggaagctaag 
1081 gaacctcgag acgaggaaaa aattaagcaa gcaaaagcga aagttgagag taaaaaagct 
1141 gaggctacaa ggttagaaaa catcaagaca gatcgtaaaa aagcagaaga agaagctaaa 
1201 cgaaaagcag cagaagaaga taaagttaaa gaaaaaccag ctgaacaacc acaaccagcg 
1261 ccggctactc aaccagaaaa accagctcca aaaccagaga agccagctga acaaccaaaa 
1321 gcagaaaaaa cagatgatca acaagctgaa gaagactatg ctcgtagatc agaagaagaa 
1381 tataatcgct tgattcaaca gcaaccgcca aaaactgaaa aaccagcaca accatttact 
1441 ccaaaaacag gctggaaaca agaaaacggt atgtggtact tctacaatac tgatggttca 
1501 atggcaacag gatggctcca atacaacggt tcatggtatt acctcaacgc taatggtgat 
1561 atggcgacag gttgggtgaa agatggagat acctggtact atcttgaagc atcaggtgct 
1621 atgaaagcaa gccaatggtt caaagtatca gataaatggt actatgtcaa tggctcaggt 
1681 gcccttgcag tcaacacaac tgtagatggc tatggagtca atgccaatgg tgaatgggta 
1741 aactaa 



// 




the above report in format 



ther Formats: 
Links: 



LOCUS AAB70838 663 aa BCT 16-SEP-1997 

DEFINITION choline binding protein A. 
ACCESSION AAB70838 
PID g2425109 

VERSION AAB70838.1 01:2425109 
DBSOURCE locus AF019904 accession AF019904.1 
KEYWORDS . 

SOURCE Streptococcus pneumoniae. 
ORGANISM Streptococcus pneumoniae 

Eubacteria; Firmicutes; Low G+C gram-positive bacteria; 

Streptococcaceae; Streptococcus. 
REFERENCE 1 (residues 1 to 663) 

AUTHORS Rosenow,C., Ryan,?., WeiserJ.N., Johnson,S., Fontan,P., 

Ortqvist,A. and Masure,H.R. 
TITLE Contribution of novel choline-binding proteins to adherence, 

colonization and immunogenicity of Streptococcus pneumoniae 
JOURNAL Mol. Microbiol. (1 997) In press 
REFERENCE 2 (residues 1 to 663) 

AUTHORS Rosenow,C., Ryan,P., WeiserJ.N., Johnson,S., Fontan,P., 

Ortqvist,A. and Masure,H.R. 
TITLE Direct Submission 

JOURNAL Submitted (18-AUG-1997) Infectious Diseases, St. Jude Children's 

Research Hospital, 322 North Lauderdale, Memphis, TN 38105, USA 
COMMENT Method: conceptual translation supplied by author. 
FEATURES Location/Qualifiers 
source 1..663 

/organism=" Streptococcus pneumoniae" 
/strain="R6x" 
/db_xref="taxon:1313" 
Protein <1..663 

/product="choline binding protein A" 
/name="CbpA" 
CDS l.,663 

/gene="cbpA" 

/coded_by="AF019904.1:<1..1992" 
/transl_table=ll 

ORIGIN 

1 enegstqaat ssnmaktehr kaakqwdey iekmlreiql drrkhtqnva Iniklsaikt 
61 kylrelnvle ekskdelpse ikakldaafe kfkkdtlkpg ekvaeakkkv eeakkkaedq 
121 keedrmypt ntyktlelei aefdvkvkea elelvkeeak esmegtikq akekveskka 
181 eatrlenikt drkkaeeeak rkadaklkea nvatsdqgkp kgrakrgvpg elatpdkken 
241 dakssdssvg eetlpssslk sgkkvaeaek kveeaekkak dqkeedrmy ptntyktldl 
301 eiaesdvkvk eaelelvkee akeprdeeki kqakakvesk kaeatrleni ktdrkkaeee 



361 akrkaaeedk vkekpaeqpq papatqpekp apkpekpaeq pkaektddqq aeedyarrse 
421 eeynrltqqq ppktekpaqp stpktgwkqe ngmwyfyntd gsmatgwlqn ngswyylnan 
481 gamatgwlqn ngswyylnan gsmatgwlqn ngswyylnan gamatgwlqy ngswyylnsn 
541 gamatgwlqy ngswyylnan gdmatgwlqn ngswyylnan gdmatgwiqy ngswyylnan 
601 gdmatgwvkd gdtwyyleas gamkasqwfk vsdkwyyvng sgalavnttv dgygvnange 
661 wvn 



the above report in format 



Other Formats: 
Links: 

LOCUS AFO 19904 2480 bp DNA BCT 22-SEP-1997 

DEFINITION Streptococcus pneumoniae choline binding protein A (cbpA) gene, 

partial cds. 
ACCESSION AFO 19904 
NID g2425108 
VERSION AF019904.1 GI:2425108 
KEYWORDS . 

SOURCE Streptococcus pneumoniae. 
ORGANISM Streptococcus pneumoniae 

Eubacteria; Firmicutes; Low G+C gram-positive bacteria; 
Streptococcaceae; Streptococcus. 
REFERENCE 1 (bases 1 to 2480) 
AUTHORS Rosenow,C., Ryan,P., WeiserJ.N., Johnson,S., Fontan,P., 

Ortqvist,A. and Masure,H.R. 
TITLE Contribution of novel choline-binding proteins to adherence, 
colonization and immvmogenicity of Streptococcus pneumoniae 
JOURNAL Mol. Microbiol. (1997) In press 
REFERENCE 2 (bases 1 to 2480) 
AUTHORS Rosenow,C., Ryan,P., Weiser,J.N., Johnson,S^ Fontan,P., 

Ortqvist,A. and Masure,H.R. 
TITLE Direct Submission 

JOURNAL Submitted ( 1 8- AUG- 1 997) Infectious Diseases, St. Jude Children's 
Research Hospital, 322 North Lauderdale, Memphis, TN 38105, USA , 
FEATURES Location/Qualifiers 
source 1..2480 

/organism="Streptococcus pneumoniae" 
/strain="R6x" 
/db_xre^"taxon:1313" 
gene <1..1992 

/gene="cbpA" 
CDS <1..1992 
/gene="cbpA" 
/note="CbpA" 
/codon_start=l 
/transl_table=l 1 

/product="choline binding protein A" 
/protein_id=" AAB70838. 1 " 
/db_xref^"PID:g2425109" 
/db_xre^"GI:2425109" 



/translation="ENEGSTQAATSSNMAKTEHRKAAKQVVDEYIEKMLREIQLDRRK 



HTQNVALNIKLSAIKTKYLRELNVLEEKSKDELPSEIKAKLDAAFEKFKKDTLKP 
GEK 

VAEAKKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAEFDVKVKEAELELV 
KEEAK 

ESRNEGTIKQAKEKVESKKAEATRLENIKTDRKKAEEEAKRKADAKLKEANVAT 
SDQG 

KPKGRAKRGVPGELATPDKKENDAKSSDSSVGEETLPSSSLKSGKKVAEAEKKV 
EEAE 

KKAKDQKEEDRRNYPTNTYKTLDLEIAESDVKVKEAELELVKEEAKEPRDEEKI 
KQAK 

AKVESKKAEATRLENIKTDRKKAEEEAKRKAAEEDKVKEKPAEQPQPAPATQPE 
KPAP 

KPEKPAEQPKAEKTDDQQAEEDYARRSEEEYNRLTQQQPPKTEKPAQPSTPKTG 
WKQE 

NGMWYFYNTDGSMATGWLQNNGSWYYLNANGAMATGWLQNNGSWYYLNA 
NGSMATGWL 

QNNGSWYYLNANGAMATGWLQYNGSWYYLNSNGAMATGWLQYNGSWYYL 
NANGDMATG 

WLQNNGSWYYLNANGDMATGWLQYNGSWYYLNANGDMATGWVKDGDTWY 
YLEASGAMK 

ASQWFKVSDKWYYVNGSGALAVNTTVDGYGVNANGEWVN" . 
BASE COUNT 933 a 452 c 575 g 520 1 
ORIGIN 

1 gaaaacgaag gaagtaccca agcagccact tcttctaata tggcaaagac agaacatagg 
61 aaagctgcta aacaagtcgt cgatgaatat atagaaaaaa tgttgaggga gattcaacta 
121 gatagaagaa aacataccca aaatgtcgcc ttaaacataa agttgagcgc aattaaaacg 
181 aagtatttgc gtgaattaaa tgttttagaa gagaagtcga aagatgagtt gccgtcagaa 
241 ataaaagcaa agttagacgc agcttttgag aagtttaaaa aagatacatt gaaaccagga 
301 gaaaaggtag cagaagctaa gaagaaggtt gaagaagcta agaaaaaagc cgaggatcaa 
361 aaagaagaag atcgtcgtaa ctacccaacc aatacttaca aaacgcttga acttgaaatt 
421 gctgagttcg atgtgaaagt taaagaagcg gagcttgaac tagtaaaaga ggaagctaaa 
481 gaatctcgaa acgagggcac aattaagcaa gcaaaagaga aagttgagag taaaaaagct 
541 gaggctacaa ggttagaaaa catcaagaca gatcgtaaaa aagcagaaga agaagctaaa 
601 cgaaaagcag atgctaagtt gaaggaagct aatgtagcga cttcagatca aggtaaacca 
661 aaggggcggg caaaacgagg agttcctgga gagctagcaa cacctgataa aaaagaaaat 
721 gatgcgaagt cttcagattc tagcgtaggt gaagaaactc ttccaagctc atccctgaaa 




781 tcaggaaaaa aggtagcaga agctgagaag aaggttgaag aagctgagaa aaaagccaag 
841 gatcaaaaag aagaagatcg ccgtaactac ccaaccaata cttacaaaac gcttgacctt 
901 gaaattgctg agtccgatgt gaaagttaaa gaagcggagc ttgaactagt aaaagaggaa 
961 gctaaggaac ctcgagacga ggaaaaaatt aagcaagcaa aagcgaaagt tgagagtaaa 
1021 aaagctgagg ctacaaggtt agaaaacatc aagacagatc gtaaaaaagc agaagaagaa 
1081 gctaaacgaa aagcagcaga agaagataaa gttaaagaaa aaccagctga acaaccacaa 
1141 ccagcgccgg ctactcaacc agaaaaacca gctccaaaac cagagaagcc agctgaacaa 
1201 ccaaaagcag aaaaaacaga tgatcaacaa gctgaagaag actatgctcg tagatcagaa 
1261 gaagaatata atcgcttgac tcaacagcaa ccgccaaaaa ctgaaaaacc agcacaacca 
1321 tctactccaa aaacaggctg gaaacaagaa aacggtatgt ggtacttcta caatactgat 
1381 ggttcaatgg caacaggatg gctccaaaac aacggttcat ggtactatct aaacgctaat 
1441 ggtgctatgg cgacaggatg gctccaaaac aatggttcat ggtactatct aaacgctaat 
1501 ggttcaatgg caacaggatg gctccaaaac aatggttcat ggtactacct aaacgctaat 
1561 ggtgctatgg cgacaggatg gctccaatac aatggttcat ggtactacct aaacagcaat 
1621 ggcgctatgg cgacaggatg gctccaatac aatggctcat ggtactacct caacgctaat 
1681 ggtgatatgg cgacaggatg gctccaaaac aacggttcat ggtactacct caacgctaat 
1741 ggtgatatgg cgacaggatg gctccaatac aacggttcat ggtattacct caacgctaat 
1801 ggtgatatgg cgacaggttg ggtgaaagat ggagatacct ggtactatct tgaagcatca 
1861 ggtgctatga aagcaagcca atggttcaaa gtatcagata aatggtacta tgtcaatggc 
1921 tcaggtgccc ttgcagtcaa cacaactgta gatggctatg gagtcaatgc caatggtgaa 
1981 tgggtaaact aaacctaata taactagtta atactgactt cctgtaagaa ctttttaaag 
2041 tattccctac aaataccata tcctttcagt agataatata cccttgtagg aagtttagat 
2101 taaaaaataa ctctgtaatc tctagccgga tttatagcgc tagagactac ggagtttttt 
2161 tgatgaggaa agaatggcgg cattcaagag actctttaag agagttacgg gttttaaact 
2221 attaagcctt ctccaattgc aagaggcttc aatctctgct agggtgctag cttgcgaaat 
2281 ggctccacgg agtttggcag cgccagatgt tccacggaga tagtgaggag cgaggccgcg 
2341 gaattcacga actgcgacgt tttctccttt gaggttaatc aatcgtttca agtgttcgta 
2401 ggcgatcttc atcttgtctt caaaggtcaa atcaggtagg atttctcctg tttcaaagtt 
2461 tatggtggcc ctggttgaag 



// 



the above report in format 



Other Formats: 
Links: 

LOCUS AAD00184 929 aa BCT 07-OCT-1996 

DEFINITION surface protein C. 
ACCESSION AAD00184 
PID g4097980 

VERSION AAD00184.1 01:4097980 
DBSOURCE locus SPU72655 accession U72655. 1 
KEYWORDS . 

SOURCE Streptococcus pneumoniae. 
ORGANISM Streptococcus pneumoniae 

Eubacteria; Firmicutes; Low G+C gram-positive bacteria; 

Streptococcaceae; Streptococcus. 
REFERENCE 1 (residues 1 to 929) 

AUTHORS Brooks- Walter^A., Tart,R.C., Briles,D.E. and Hoilingshead,S.K. 
TITLE The pspC gene encodes a second pneumococcal surface protein 

homologous to the protection-eliciting PspA protein of 

Streptococcus pneumoniae 
JOURNAL Unpublished 
REFERENCE 2 (residues 1 to 929) 

AUTHORS Brooks-Walter,A., Tart,R.C., Briles,D.E. and Hollingshead,S.K. 
TITLE Direct Submission 

JOURNAL Submitted (26-SEP-1996) Microbiology, University of Alabama at 
Birmingham, 19th Street South, Box 10, Birmingham, AL 35294-2170, 
USA 

FEATURES Location/Qualifiers 
source 1..929 

/organism=" Streptococcus pneumoniae" 

/strain="EF6796" 

/db_xref="taxon:1313" 
Protein 1..929 

/product="surface protein C" 
CDS 1..929 

/gene="pspC" 

/coded_by="U72655.1:319..3108" 
/transljable=ll 

ORIGIN 

1 mfaskserkv hysirkfsig vasvavaslf Iggwhaegv rsgnnltvts sgqdiskkya 
61 deveshlesi Ikdvkknlkk vqhtqnvgli tklseikkky lydlkvnvls eaeltsktke 
121 tkekltatfe qfkkdtlpte pekkvaeaqk kveeakkkae dqkekdrmy ptityktlel 
181 eiaesdvevk kaelelvkvk akesqdeeki kqaeaevesk qaeatrlkki ktdreeakrk 
241 adaklkeave knvatseqdk pkrrakrgvs gelatpdkke ndakssdssv geetlpspsl 
301 nmanesqteh rkdvdeyikk mlseiqldrr khtqnvnini klsaiktkyl yelsvlkens 
361 kkeeltsktk aeltaafeqf kkdtlkpekk vaeaekkvee akkkakdqke edrmyptnt 
421 yktleleiae sdvkvkeael elvkeeanes meekikqak ekveskkaea trlekiktdr 




481 kkaeeeakrk aeesekkaae akqkvdaeey aleakiaele yevqrlekel keidesdsed 
541 ylkeglrapl qskldtkkak Iskleelsdk ideldaeiak levqlkdaeg mmveayfke 
601 glekttaekk aelekaeadl kkavdepetp apapqpapap ekpaekpapa pekpapapek 
661 papapekpap apekpapape kpaptpetpk tgwkqengmw yfyntdgsma tgwlqnngsw 
721 yylnsngama tgwlqnngsw yylnsngama tgwlqyngsw yylnangdma tgwlqyngsw 
781 yylnangdma tgwfqyngsw yylnangdma tgwfqyngsw yylnangdma tgwlqyngsw 
841 yylnsngamv tgwlqnngsw yylnangsma tdwvkdgdtw yyleasgamk asqwfkvsdk 
901 wyyvngsgal avnttvdsyr vnangewvn 



// 



the above report in format 




LOCUS SPU72655 3463 bp DNA BCT 02-JAN-1999 

DEFINITION Streptococcus pneumoniae surface protein C (pspC) gene, complete 
cds. 

ACCESSION U72655 

NID g4097979 

VERSION U72655.1 01:4097979 

KEYWORDS . 

SOURCE Streptococcus pneumoniae. 
ORGANISM Streptococcus pneumoniae 

Eubacteria; Firmicutes; Low G+C gram-positive bacteria; 

Streptococcaceae; Streptococcus. 
REFERENCE 1 (bases 1 to 3463) 

AUTHORS Brooks- WaIter,A., Tart,R.C., BriIes,D.E. and Hollingshead,S.K. 
TITLE The pspC gene encodes a second pneumococcal surface protein 
homologous to the protection-eliciting PspA protein of 
Streptococcus pneumoniae 
JOURNAL Unpublished 
REFERENCE 2 (bases 1 to 3463) 
AUTHORS Brooks- Waher,A-, Tart,R.C., Briles,D.E. and Hollingshead,S.K. 
TITLE Direct Submission 

JOURNAL Submitted (26-SEP- 1 996) Microbiology, University of Alabama at 
Birmingham, 19th Street South, Box 10, Birmingham, AL 35294-2170, 
USA 

FEATURES Location/Qualifiers 
source 1..3463 

/organism=" Streptococcus pneumoniae" 

/strain="EF6796" 

/db_xref="taxon:1313" 
gene 319..3108 

/gene="pspC" 
CDS 319..3108 

/gene="pspC" 

/codon_start=l 

/transl_table=l 1 

/product="surface protein C" 

/protein_id=" AADOO 1 84. 1 " 

/db_xref="PID:g4097980" 

/db_xref^"GI:4097980" 

/translation="MFASKSERKVHYSIRKFSIGVASVAVASLFLGGVVHAEGVRSGN 

NLTVTSSGQDISKKYADEVESHLESILKDVKKNLKKVQHTQNVGLITKLSEIKKK 
YLY 



Other Formats: 
Links: 




DLKVNVLSEAELTSKTKETKEKLTATFEQFKKDTLPTEPEKKVAEAQKKVEEAK 
KKAE 

DQKEKDRRNYPTITYKTLELEIAESDVEVKKAELELVKVKAKESQDEEKIKQAEA 
EVE 

SKQAEATRLKKIKTDREEAKRKADAKLKEAVEKNVATSEQDKPKRRAKRGVSG 
ELATP 

DKKENDAKSSDSSVGEETLPSPSLNMANESQTEHRKDVDEYIKKMLSEIQLDRR 
KHTQ 

NVNLNIKLSAIKTKYLYELSVLKENSKKEELTSKTKAELTAAFEQFKKDTLKPEK 
KVA 

EAEKKVEEAKKKAKDQKEEDRRNYPTNTYKTLELEIAESDVKVKEAELELVKEE 
ANES 

RNEEKIKQAKEKVESKKAEATRLEKIKTDRKKAEEEAKRKAEESEKKAAEAKQK 
VDAE 

EYALEAKIAELEYEVQRLEKELKEIDESDSEDYLKEGLRAPLQSKLDTKKAKLSK 
LEE 

LSDKIDELDAEIAKLEVQLKDAEGNNNVEAYFKEGLEKTTAEKKAELEKAEADL 
KKAV 

DEPETPAPAPQPAPAPEKPAEKPAPAPEKPAPAPEKPAPAPEKPAPAPEKPAPAPE 
KP 

APTPETPKTGWKQENGMWYFYNTDGSMATGWLQNNGSWYYLNSNGAMATG 
WLQNNGSW 

YYLNSNGAMATGWLQYNGSWYYLNANGDMATGWLQYNGSWYYLNANGDM 
ATGWFQYNG 

SWYYLNANGDMATGWFQYNGSWYYLNANGDMATGWLQYNGSWYYLNSNG 
AMVTGWLQN 

NGSWYYLNANGSMATDWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVNG 
SGALAVNT 

TVDSYRVNANGEWVN" 
BASE COUNT 1326 a 591c 755 g 790 1 1 others 
ORIGIN 

1 aagcttatgc ttgtcaataa tcacaaatat gtagatcata tcttgtttag gacagtaaaa 



61 catcctaatt actttttaaa tattttacct gagttgattg gcttgacctt gttgagtcat 
121 gcctatatga cttttgtttt agtttttcca gtttatgcag ttattttgta tcgacgaata 
181 gctgaagagg aaaagttatt acatgaagtt ataatcccaa atggaagcat aaagagataa 
241 atacaaaatt cgatttatat acagttcata ttgaagtgat atagtaaggt taaagaaaaa 
301 atatagaagg aaataaacat gtttgcatca aaaagcgaaa gaaaagtaca ttattcaatt 
361 cgtaaattta gtattggagt agctagtgta gctgttgcca gcttgttctt aggaggagta 
421 gtccatgcag aaggggttag aagtgggaat aacctcacgg ttacatctag tgggcaagat 
481 atatcgaaga agtatgctga tgaagtcgag tcgcatctag aaagtatatt gaaggatgtc 
541 aaaaaaaatt tgaaaaaagt tcaacatacc caaaatgtcg gcttaattac aaagttgagc 
601 gaaattaaaa agaagtattt gtatgactta aaagttaatg ttttatcgga agctgagttg 
661 acgtcaaaaa caaaagaaac aaaagaaaag ttaaccgcaa cttttgagca gtttaaaaaa 
721 gatacattac caacagaacc agaaaaaaag gtagcagaag ctcagaagaa ggttgaagaa 
781 gctaagaaaa aagccgagga tcaaaaagaa aaagatcgcc gtaactaccc aaccattact 
841 tacaaaacgc ttgaacttga aattgctgag tccgatgtgg aagttaaaaa agcggagctt 
901 gaactagtaa aagtgaaagc taaggaatct caagacgagg aaaaaattaa gcaagcagaa 
961 gcggaagttg agagtaaaca agctgaggct acaaggttaa aaaaaatcaa gacagatcgt 
1021 gaagaagcta aacgaaaagc agatgctaag ttgaaggaag ctgttgaaaa gaatgtagcg 
1081 acttcagagc aagataaacc aaagaggcgg gcaaaacgag gagtttctgg agagctagca 
1141 acacctgata aaaaagaaaa tgatgcgaag tcttcagatt ctagcgtagg tgaagaaact 
1201 cttccaagcc catcccttaa tatggcaaat gaaagtcaga cagaacatag gaaagatgtc 
1261 gatgaatata taaaaaaaat gttgagtgag atccaattag atagaagaaa acatacccaa 
1321 aatgtcaact taaacataaa gttgagcgca attaaaacga agtatttgta tgaattaagt 
1381 gttttaaaag agaactcgaa aaaagaagag ttgacgtcaa aaaccaaagc agagttaacc 
1441 gcagcttttg agcagtttaa aaaagataca ttgaaaccag aaaaaaaggt agcagaagct 
1501 gagaagaagg ttgaagaagc taagaaaaaa gccaaggatc aaaaagaaga agatcgccgt 
1561 aactacccaa ccaatactta caaaacgctt gaacttgaaa ttgctgagtc cgatgtgaaa 
1621 gttaaagaag cggagcttga actagtaaaa gaggaagcta acgaatctcg aaacgaggaa 
1681 aaaattaagc aagcaaaaga gaaagttgag agtaaaaaag ctgaggctac aaggttagaa 
1741 aaaatcaaga cagatcgtaa aaaagcagaa gaagaagcta aacgaaaagc agaagaatct 
1801 gagaaaaaag ctgctgaagc caaacaaaaa gtggatgctg aagaatatgc tcttgaagct 
1861 aaaatcgctg agttggaata tgaagttcag agactagaaa aagagctcaa agagattgat 
1921 gagtctgact cagaagatta tcttaaagaa ggcctccgtg ctcctcttca atctaaattg 
1981 gataccaaaa aagctaaact atcaaaactt gaagagttga gtgataagat tgatgagtta 
2041 gacgctgaaa ttgcaaaact tgaagttcaa cttaaagatg ctgaaggaaa caataatgta 
2101 gaagcctact ttaaagaagg tttagagaaa actactgctg agaaaaaagc tgaattagaa 
2161 aaagctgaag ctgaccttaa gaaagcagtt gatgagccag aaactccagc tccggctcct 
2221 caaccagctc cagctccaga aaaaccagct gaaaaaccag ctccagctcc agaaaaacca 
2281 gctccagctc cagaaaaacc agctccagct ccagaaaaac cagctccagc tccagaaaaa 
2341 ccagctccag ctccagaaaa accagctcca actccagaaa ctccaaaaac aggctggaaa 
2401 caagaaaacg gtatgtggta cttctacaat actgatggtt caatggcaac aggctggctc 
2461 caaaacaatg gctcatggta ctacctcaac agcaatggcg ctatggcgac aggatggctc 
2521 caaaacaatg gctcatggta ctacctcaac agcaatggcg ctatggcgac aggatggctc 
2581 caatacaatg gttcatggta ctacctcaac gctaatggtg atatggcgac aggatggctc 
2641 caatacaatg gttcatggta ctacctcaac gctaatggtg atatggcgac aggatggttc 
2701 caatacaatg gttcatggta ctacctcaac gctaatggtg atatggcgac aggatggttc 
2761 caatacaatg gttcatggta ctacctcaac gctaatggtg atatggcgac aggatggctc 



2821 caatacaatg gttcatggta ctacctaaac agcaatggtg ctatggtaac aggatggctc 
2881 caaaacaatg gctcatggta ctacctaaac gctaacggtt caatggcaac agattgggtg 
2941 aaagatggag atacctggta ctatcttgaa gcatcaggtg ctatgaaagc aagccaatgg 
3001 ttcaaagtat cagataaatg gtactatgtc aatggctcag gtgcccttgc agtcaacaca 
3061 actgtagata gctatagagt caatgccaat ggtgaatggg taaactaaac ttaatataac 
3121 tagttaatac tgacttcctg taagaactct ttaaagtatt ccctacaaat accatatcct 
3181 ttcagtagat aatataccct tgtaggaagt ttagattaaa aaataactct gtaatctcta 
3241 gccggattta tagcgctaga gactacggag tttttttgat gaggaaagaa tggcggcatt 
3301 caagagactc tttaagagag ttacgggttt taaactatta agctttctcc aattgcaaga 
3361 gggcttcaat ctctgctagg tgctagcttg cgaaatggct cccacggagt ttggcrgcgc 
3421 cagatgttcc acggaggtag tgaggagcga ggccgcggaa ttc 



the above report in format 
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LOCUS E134.DNA 2405 BP SS-DNA SYN MBlMW 

DEFINITION - 
ACCESSION - 
KEYWORDS - 
SOURCE - 

FEATURES Location/Qualifiers 

CDS 187.2405 
BASE COUNT 1046 A 386 C 505 G 468 T 0 OTHER 
ORIGIN - 

1 CGGCCGCCAG TGTGATGGAT ATCTGCAGAA TTCGCCCTTC GACGAATAGC TGAAGAGGAA 
61 AAGCTATTAC ATGAAGTTAT AATCCCAAAT GGAAGCATAA AGAGATAAAT ACAAAATTCG 
121 ATTTATATAC AGTTCATATT GAAGTGATAT AGTAAGGTTA AAGAAAAAAT ATAGAAGGAA 
181 ATAAACATGT TTGCATCAAA AAGCGAAAGA AAAGTACATT ATTCAATTCG TAAATTTAGT 
241 ATTGGAGTAG CTAGTGTAGT AGTTGCTAGT CTTGTTATGG GAAGTGTGGT TCATGCGACG 
301 GAGAATGAGG GAATTACCCA AGTAGCCACT TCTTATAATA AGGCAAATGA AAGTCAGACA 
361 GAACATAGGA AAGCTGCTAA ACAAGTCGAT GAAGATATAA AAAAAATGTT GAGTGAGATC 
421 CAAGAATATA TAAAAAAAAT GTTGAGTGAG ATCCAATTAG ATAAAAGAAA ACATACCCAA 
481 AATGTCAACT TAAACAGAAA GTTGAGCGCA ATTCAAACGA AGTATTTGTA TGAATTAAGA 
541 GTTTTAAAAG AGAAGTCGAA AAAAGAAGAG TTGACGTCAA AAACAAAAAA AGAGTTAGAC 
601 GCAGCTTTTG AGAAGTTTAA AAAAGAACCA GAACTTACTA AAAAATTAGC AGAAGCTAAA 
661 CAAAAAGCCA AGGCTCAAAA AGAAGAAGAT TTCCGTAACT ACCCAACCAA TACTTACAAA 
721 ACGCTTGAAC TTGAAATTGC TGAGTTCGAT GTGAAAGTTA AAGAAGCGGA GCTTGAACTA 
781 GTAAAAGAGG AAGCTAAACC CCGAAACGAG GAAAAAATTA AGCAAGCAAA AGCGAAAGTT 
841 GAGAGTAAAA AAGCTGAGGC TACAAGGTTA GAAGAAATCA AGACAGAACG TAAAAAAGCA 
901 GAAGAAGAAG CTAAACGAAA AGCAGAAGAA TCTGAGAAAA AAGCTGCTGA AGCCAAACAA 
961 AAAGTGGATA CTAAAGAGCA AGGTAAACCA AAGAGGCGGG CAAAACGAGG 
AGTTTCTGGA 

1021 GAGCTAGCAA CACCTGATAA AAAAGAAAAT GATGCGAAGT CTTCAGATTC TAGCGTAGGT 
1081 GAAGAAACTC TTCCAAGCCC ATCCCTTAAT ATGGCAAATG AAAGTCAGAC AGAACATAGG 
1 141 AAAGATGTCG ATGAATATAT AAAAAAAATG TTGAGTGAGA TCCAATTAGA TAGAAGAAAA 
1201 CATACCCAAA ATGTCAACTT AAACATAAAG TTGAGCGCAA TTAAAACGAA GTATTTGTAT 
1261 GAATTAAGTG TTTTAAAAGA GAACTCGAAA AAAGAAGAGT TGACGTCAAA AACCAAAGCA 
1321 GAGTTAACCG CAGCTTTTGA GCAGTTTAAA AAAGATACAT TGAAACCAGA AAAAAAGGTA 
1381 GCAGAAGCTG AGAAGAAGGT TGAAGAAGCT AAGAAAAAAG CCAAGGATCA 
AAAAGAAGAA 

1441 GATCGCCGTA ACTACCCAAC CAATACTTAC AAAACGCTTG AACTTGAAAT TGCTGAGTCC 
1501 GATGTGAAAG TTAAAAAAGC GGAGCTTGAA CTAGTAAAAG AGGAAGCTAA CGAATCTCGA 
1561 AACGAGGAAA AAATTAAGCA AGCAAAAGAG AAAGTTGAGA GTAAAAAAGC TGAGGCTACA 
1621 AGGTTAGAAA AAATCAAGAC AGATCGTAAA AAAGCAGAAG AAGAAGCTAA ACGAAAAGCA 
1681 GAAGAATCTG AGAAAAAAGC TGCTGAAGCC AAACAAAAAG TGGATGCTGA AGAATATGCT 
1741 CTTGAAGCTA AAATCGCTGA GTTGGAATAT GAAGTTCAGA GACTAGAAAA AGAGCTCAAA 
1801 GAGATTGATG AGTCTGACTC AGAAGATTAT CTTAAAGAAG GCCTCCGTGC TCCTCTTCAA 
1861 TCTAAATTGG ATACCAAAAA AGCTAAACTA TCAAAACTTG AAGAGTTGAG TGATAAGATT 
1921 GATGAGTTAG ACGCTGAAAT TGCAAAACTT GAAGTTCAAC TTAAAGATGC TGAAGGAAAC 
1981 AATAATGTAG AAGCCTACTT TAAAGAAGGT TTAGAGAAAA CTACTGCTGA GAAAAAAGCT 
2041 GAATTAGAAA AAGCTGAAGC TGACCTTAAG AAAGCAGTTG ATGAGCCAGA AACTCCAGCT 
2101 CCGGCTCCTC AACCAGCTCC AGCTCCAGAA AAACCAGCTG AAAAACCAGC 
TCCAGCTCCA 

2161 GCTCCAGAAA AACCAGCTCC AGCTCCAGAA AAACCAGCTG AAAAACCAGC 
TGAAAAACCA 

2221 GCTGAAGAAC CAGCTGAAAA ACCAGCTCCA GCTCCAGAAA AACCAGCTCC 
AACTCCAGAA 

2281 AAACCAGCTG CAACTCCAGA AACTCCAAAA ACAGGCTGGA AACAAGAAAA 
CGGTATGTGG 
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2341 TACTTCTACA ATACTGATGG TTCAATGGCA ACAGGCTGGC TCCAAAACAA TGGTTCATGG 
2401 TACTA 

// 
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DEFINITION - 
ACCESSION 
KEYWORDS 
SOURCE - 
FEATURES 



LOCUS 



D39.DNA 



2774 BP SS-DNA 



SYN 



Location/Qualifiers 
362..2467 



BASE COUNT 1057 A 476 C 615 G 624 T 2 OTHER 
ORIGIN - 

1 CCAAGCTATT AGGTGACACT ATAGAATACT CAAGCTATGC ATCAAGCTTA TGCTTGTCAA 
61 TAATCACAAA TATGTAGATC ATATCTTGTT TAGGACAGTA AAACATCCTA ATTAC I I I I I 
121 AAATATTCTT CCTGAGTTGA TTGGCTTGAC CTTGTTGAGT CATGCTTATG TGACTTTTGT 
181 TTTAG I I I I I CCAGTTTATG CAGTTATTTT GTATCGACGA ATAGCTGAAG AGGAAAAGCT 
241 ATTACATGAA GTTATAATCC CAAATGGAAG CATAAAGAGA TAAATACAAA ATTCGATTTA 
301 TATACAGTTC ATATTGAAGT AATATAGTAA GGTTAAAGAA AAAATATAGA AGGAAATAAA 
361 CATGTTTGCA TCAAAAAGCG AAAGAAAAGT ACATTATTCA ATTCGTAAAT TTAGTATTGG 
421 AGTAGCTAGT GTAGCTGTTG CCAGTCTTGT TATGGGAAGT GTGGTTCATG CGACAGAGAA 
481 CGAGGGAAGT ACCCAAGCAG CCACTTCTTC TAATATGGCA AAGACAGAAC ATAGGAAAGC 
541 TGCTAAACAA GTCGTCGATG AATATATAGA AAAAATGTTG AGGGAGATTC AACTAGATAG 
601 AAGAAAACAT ACCCAAAATG TCGCCTTAAA CATAAAGTTG AGCGCAATTA AAACGAAGTA 
661 TTTGCGTGAA TTAAATGTTT TAGAAGAGAA GTCGAAAGAT GAGTTGCCGT CAGAAATAAA 
721 AGCAAAGTTA GACGCAGCTT TTGAGAAGTT TAAAAAAGAT ACATTGAAAC CAGGAGAAAA 
781 GGTAGCAGAA GCTAAGAAGA AGGTTGAAGA AGCTAAGAAA AAAGCCGAGG ATCAAAAAGA 
841 AGAAGATCGT CGTAACTACC CAACCAATAC TTACAAAACG CTTGAACTTG AAATTGCTGA 
901 GTTCGATGTG AAAGTTAAAG AAGCGGAGCT TGAACTAGTA AAAGAGGAAG CTAAAGAATC 
961 TCGAAACGAG GGCACAATTA AGCAAGCAAA AGAGAAAGTT GAGAGTAAAA AAGCTGAGGC 
1021 TACAAGGTTA GAAAACATCA AGACAGATCG TAAAAAAGCA GAAGAAGAAG CTAAACGAAA 
1081 AGCAGATGGT AAGTTGAAGG AAGCTAATGT AGCGACTTCA GATCAAGGTA 
AACCAAAGGG 

1 141 GCGGGCAAAA CGAGGAGTTC CTGGAGAGCT AGCAACACCT GATAAAAAAG 
AAAATGATGC 

1201 GAAGTCTTCA GATTCTAGCG TAGGTGAAGA AACTCTTCCA AGCTCATCCC TGAAATCAGG 
1261 AAAAAAGGTA GCAGAAGCTG AGAAGAAGGT TGAAGAAGCT GAGAAAAAAG 
CCAAGGATCA 

1321 AAAAGAAGAA GATCGCCGTA ACTACCCAAC CAATACTTAC AAAACGCTTG ACCTTGAAAT 
1381 TGCTGAGTCC GATGTGAAAG TTAAAGAAGC GGAGCTTGAA CTAGTAAAAG 
AGGAAGCTAA 

1441 GGAACCTCGA GACGAGGAAA AAATTAAGCA AGCAAAAGCG AAAGTTGAGA 
GTAAAAAAGC 

1501 TGAGGCTACA AGGTTAGAAA ACATCAAGAC AGATCGTAAA AAAGCAGAAG AAGAAGCTAA 
1561 ACGAAAAGCA GCAGAAGAAG ATAAAGTTAA AGAAAAACCA GCTGAACAAC CACAACCAGC 
1621 GCCGGCTACT CAACCAGAAA AACCAGCTCC AAAACCAGAG AAGCCAGCTG 
AACAACCAAA 

1681 AGCAGAAAAA ACAGATGATC AACAAGCTGA AGAAGACTAT GCTCGTAGAT CAGAAGAAGA 
1741 ATATAATCGC TTGACTCAAC AGCAACCGCC AAAAACTGAA AAACCAGCAC AACCATCTAC 
1801 TCCAAAAACA GGCTGGAAAC AAGAAAACGG TATGTGGTAC TTCTACAATA CTGATGGTTC 
1861 AATGGCAACA GGATGGCTCC AAAACAACGG TTCATGGTAC TATCTAAACG CTAATGGTGC 
1921 TATGGCGACA GGATGGCTCC AAAACAATGG TTCATGGTAC TATCTAAACG CTAATGGTTC 
1981 AATGGCAACA GGATGGCTCC AAAACAATGG TTCATGGTAC TACCTAAACG CTAATGGTGC 
2041 TATGGCGACA GGATGGCTCC AATACAATGG TTCATGGTAC TACCTAAACA 
GCAATGGCGC 

2101 TATGGCGACA GGATGGCTCC AATACAATGG CTCATGGTAC TACCTCAACG 
CTAATGGTGA 

2161 TATGGCGACA GGATGGCTCC AAAACAACGG TTCATGGTAC TACCTCAACG 
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CTAATGGTGA 

2221 TATGGCGACA GGATGGCTCC AATACAACGG TTCATGGTAT TACCTCAACG CTAATGGTGA 
2281 TATGGCGACA GGTTGGGTGA AAGATGGANA TACCTGGTAC TATCTTAAAG 
CATCAGGTGC 

2341 TATGAAAGCA AGCCAATGGT TCAAAGTATC AGATAAATGG TACTATGTCA ATGGCTCAGG 
2401 TGCCCTTGCA GTCAACACAA CTGTAGATGG CTATGGAGTC AATGCCAATG 
GTGAATGGGT 

2461 AAACTAAACC TAATATAACT AGTTAATACT GACTTCCTGT AAGAACTTTT TAAAGTATTC 
2521 CCTACAAATA CCATATCCTT TCAGTAGATA ATATACCCTT GTAGGAAGTT TAGATTAAAA 
2581 AATAACTCTG TAATCTCTAG CCGGATTTAT AGCGCTAGAG ACTACGGAGT I I I I I IGATG 
2641 AGGAAAGAAT GGCGGCATTC AAGAGACTCT TTAAGAGAGT TACGGGTTTT AAACTATTAA 
2701 GCCTTCTCCA ATTGCAAGAG GGCTTCAATC TCTGCTAGGG TGCTAGCTTG 
CGAAATGGCT 

2761 CCACGGAGTTTNGC 



// 





LOCUS BG9163.DNA 2489 BP SS-DNA SYN 
DEFINITION - 




ACCESSION - 
KEYWORDS - 
SOURCE - 

FEATURES Location/Qualifiers 

CDS 221.. 2489 

BASE COUNT 1026 A 429 C 541 G 493 T 0 OTHER 
ORIGIN - 

1 GATTGTATAC GACCACTATA GGGCGAATTG GGCCCGACGT CGCATGCTCC CGGCCGCCAT 
61 GGCCGCGGGT ATTCGACGAA TAGCTGAAGA GGAAAAGCTA TTACATGAAG TTATAATCCC 
121 AAATGGAAGC ATAAAGAGAT AAATACAACA TTCGATTTAT ATACAGTTCC TATTGAAGTG 
181 ATATAATAAG GTTAAAGAAA AAATATAGAA GGAAATAAAC ATGTTTGCAT CAAAAAGCGA 
241 AAGAAAAGTA CATTATTCAA TTCGTAAATT TAGTATTGGA GTAGCTAGTG TAGCTGTTGC 
301 CAGCTTGTTC TTAGGAGGAG TAGTCCATGC AGAAGGGGTT AGAAGTGGGA ATAACCTCAC 
361 GGTTACATCT AGTGGGCAAG ATATATCGAA GAAGTATGCT GATGAAGTCG AGTCGCATCT 
421 AGAAAGTATA TTGAAGGATG TCAAAAAAAA TTTGAAAAAA GTTCAACATA CCCAAAATGT 
481 CGGCTTAATT ACAAAGTTGA GCGAAATTAA AAAGAAGTAT TTGTATGACT TAAAAGTTAA 
541 TGTTTTATCG GAAGCTGAGT TGACGTCAAA AACAAAAGAA ACAAAAGAAA AGTTAACCGC 
601 AACTTTTGAG CAGTTTAAAA AAGATACATT ACCAACAGAA CCAGAAAAAA AGGTAGCAGA 
661 AGCTCAGAAG AAGGTTGAAG AAGCTAAGAA AAAAGCCGAG GATCAAAAAG AAAAAGATCG 
721 CCGTAACTAC CCAACCATTA CTTACAAAAC GCTTGAACTT GAAATTGCTG AGTCCGATGT 
781 GGAAGTTAAA AAAGCGGAGC TTGAACTAGT AAAAGTGAAA GCTAAGGAAT CTCAAGACGA 
841 GGAAAAAATT AAGCAAGCAG AAGCGGAAGT TGAGAGTAAA CAAGCTGAGG CTACAAGGTT 
901 AAAAAAAATC AAGACAGATC GTGAAGAAGC TAAACGAAAA GCAGATGCTA AGTTGAAGGA 
961 AGCTGTTGAA AAGAATGTAG CGACTTCAGA GCAAGATAAA CCAAAGAGGC 
GGGCAAAACG 

1021 AGGAGTTTCT GGAGAGCTAG CAACACCTGA TAAAAAAGAA AATGATGCGA AGTCTTCAGA 
1081 TTCTAGCGTA GGTGAAGAAA CTCTTCCAAG CCCATCCCTT AATATGGCAA ATGAAAGTCA 
1 141 GACAGAACAT AGGAAAGATG TCGATGAATA TATAAAAAAA ATGTTGAGTG AGATCCAATT 
1201 AGATGGAAGA AAACATACCC CAAATGTCAA CTTAAACATA AAGTTGAGCG CAATTAAAAC 
1261 GAAGTATTTG TATGAATTAA GTGTTTTAAA AGAGAACTCG AAAAAAGAAG AGTTGACGTC 
1321 AAAAACCAAA GCAGAGTTAA CCGCAGCTTT TGAGCAGTTT AAAAAAGATA CATTGAAACC 
1381 AGAAAAAAAA GTAGCAGAAG CTGAGAAGAA GGTTGAAGAA GCTAAGAAAA 
AAGCCAAGGA 

1441 TCAAAAAGAA GAAGATCGCC GTAACTACCC AACCAATACT TACAAAACGC TTGAACTTGA 
1501 AATTGCTGAG TCCGATGTGA AAGTTAAAGA AGCGGAGCTT GAACTAGTAA 
AAGAGGAAGC 

1561 TAACGAATCT CGAAACGAGG AAAAAATTAA GCAAGCAAAA GAGAAAGTTG AGAGTAAAAA 
1621 AGCTGAGGCT ACAAGGTTAG AAAAAATCAA GACAGATCGT AAAAAAGCAG AAGAAGAAGC 
1681 TAAACGAAAA GCAGAAGAAT CTGAGAAAAA AGCTGCTGAA GCCAAACAAA AAGTGGATGC 
1741 TGAAGAATAT GCTCTTGAAG CTAAAATCGC TGAGTTGGAA TATGAAGTTC AGAGACTAGA 
1801 AAAAGAGCTC AAAGAGATTG ATGAGTCTGA CTCAGAAGAT TATCTTAAAG AAGGCCTCCG 
1861 TGCTCCTCTT CAATCTAAAT TGGATACCAA AAAAGCTAAA CTATCAAAAC TTGAAGAGTT 
1921 GAGTGATAAG ATTGATGAGT TAGACGCTGA AATTGCAAAA CTTGAAGTTC AACTTAAAGA 
1981 TGCTGAAGGA AACAATAATG TAGAAGCCTA CTTTAAAGAA GGTTTAGAGA AAACTACTGC 
2041 TGAGAAAAAA GCTGAATTAG AAAAAGCTGA AGCTGACCTT AAGAAAGCAG TTGATGAGCC 
2101 AGAAACTCCA GCTCCGGCTC CTCAACCAGC TCCGGCTCCA GAAAAACCAG 
CTGAAAAACC 

2161 AGCTCCAGCT CCAGCTCCAG AAAAACCAGC TCCAGCTCCA GAAAAACCAG 
CTCCAGCTCC 

2221 AGAAAAACCA GCTCCAGCTC CAGAAAAACC AGCTCCAGCT CCAGAAAAAC 
CAGCTCCAGC 

2281 TCCAGAAAAA CCAGCTCCAG CTCCAGAAAA ACCAGCTCCA GCTCCTAAAC CAGAAACTCC 
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2341 AGAAACAGGC TGGAAACAAG AAAACGGTAT GTGGTACTTC TACAATACTG ATGGTTCAAT 
2401 GGCAACAGGC TGGCTCCAAA ACAATGGCTC ATGGTACTAC CTCAACAGCA 
ATGGCGTTAT 

2461 GGCGACAGGA TGGTTCCCAA ACAATGGTC 

// 




LOCUS BG8090.DNA 1680BPSS-DNA SYN 
DEFINiTION - 
ACCESSION - 
KEYWORDS - 
SOURCE - 

FEATURES Location/Qualifiers 

CDS 220..1680 
BASE COUNT 714 A 287 C 375 G 304 T 0 OTHER 
ORIGIN - 

1 ATTGTATACG ACTCACTATA GGGCGAATTG GGCCCGACGT CGCATGCTCC CGGCCGCCAT 
61 GGCCGCGGGA TTCGACGAAT AGCTGAAGAG GAAAAGCTAT TACATGAAGT TATAATCCCA 
121 AATGGAAGCA TAAAGAGATA AATACAAAAT TCGATTTATA TACAGTTCAT ATTGAAGTGA 
181 TATAGTAAGG TTAAAGAAAA AATATAGAAG GAAATAAACA TGTTTGCATC AAAAAACGAA 
241 AGAAAAGTAC ATTATTCAAT TCGTAAATTT AGTATTGGAG TAGCTAGTGT AGCTGTTGCC 
301 AGTCTTTTTA TGGGAAGTGT GGTTCATGCG ACAGAGAAGG AGGTAACTAC CCAAGTAGCC 
361 ACTTCTTTTA ATAAGGCAAA TAAAAGTCAG ACAGAACATA TGAAAGCTGC TAAACAAGTC 
421 GATGAATATA TAACAAAAAA GCTCCAATTA GATAGAAGAA AACATACCCA AAATGTCGGC 
481 TTACTCACAA AGTTGGGCGT AATTAAAACG GAGTATTTGC ATAGATTAAG TGTTTCAAAA 
541 GAGAAGTCGG AAGCTGAGTT GCCGTCAGAA ATAAAAGCAA AGTTAGACGC AGCTTTTGAG 
601 CAGTTTAAAA AAGATACATT ACCAACAGAA CCAGGAAAAA AGGTAGCAGA AGCTGAGAAG 
661 AAGGTTGAAG AAGCTAAGAA AAAAGCCGAG GATCAAAAAG AAGAAGATCG TCGTAACTAC 
721 CCAACCATTA CTTACAAAAC GCTTGAACTT GAAATTGCTG AGTCCGATGT GGAAGTTAAA 
781 AAAGCGGAGC TTGAACTAGT AAAAGAGGAA GCTAAGGGAT CTCGAAACGA GCAAAAAGTT 
841 AACCAAGCAA AAGCGAAAGT TGAGAGTAAA CAAGCTGAGG CTACAAGGTT AAAAAAAATC 
901 AAGACAGATC GTGAACAAGC TGAGACTACA AGGTTAGAAA ACATCAAGAC AGATCGTGAA 
961 AAAGCAGAAG AAGCTAAACG AAAAGCAGAT GCTAAAGAGC AAGATGAATC 
AAAGAGGCGG 

1021 GTAAAAGGAG GAGTTCCGGG AGAGCAAGCA ACACTTGATA AAAAAGAAAA 
TGATGCGAAG 

1081 TCTTCAGATT CTAGCGTAGG TGAAGAAACT CTTCCAAGCC CATCCCTGAA ATCAGGAAAA 
1 141 AAGGTAGCAG AAGCTGAGAA GAAGGTTGCA GAAGCTGAGA AAAAAGCCAA 
GGATCAAAAA 

1201 GAAGAAGATC GCCGTAACTA CCCAACCAAT ACTTACAAAA CGCTTGAACT TGAAATTGCT 
1261 GAGTCCGATG TGAAAGTTAA AGAAGCGGAG CTTGAACTAG TAAAAGAGGA 
AGCTAAGGAA 

1321 TCTCGAAACG AGGAAAAAGT TAAGCAAGCA AAAGCGGAAG TTGAGAGTAA 
AAAAGCTGAG 

1381 GCTACAAGGT TAGAAAAAAT CAAGACAGAT CGTAAAAAAG CAGAAGAAGC TAAACGAAAA 
1441 GCAGCAGAAG AAGATAAAGT TAAAGAAAAA CCAGCTGAAC AACCACAACC 
AGCGCCGGCT 

1501 CCTCAACCAG AAAAACCAGC TCCAGCTCCA AAACCAGAGA ATCCAGCTGA ACAACCAAAA 
1561 GCAGAAAAAC CAGCTGATCA ACAAGCTGAA GAAGACTATG CTCGTAGATC AGAAGAAGAA 
1621 TATAATCGCT TGACTCAACA GCAACCGCCA AAAACTGAAA AACCAGCACA ACCATCTACT 



LOCUS L81905.DNA 1766BPSS-DNA SYN 
DEFINITION - 
ACCESSION - 




KEYWORDS - 
SOURCE - 

FEATURES Location/Qualifiers 

CDS 217..1766 
BASE COUNT 741 A 313 C 402 G 309 T 1 OTHER 
ORIGIN - 

1 GTATACGACT CACTATAGGG CGAATTGGGC CCGACGTCGC ATGCTCCCGG 
CCGCCATGGC 

61 CGCGGGATTC GACGAATAGC TGAAGAGGAA AAGCTATTAC ATGAAGTTAT AATCCCAAAT 
121 GGAAGCATAA AGAGATAAAT ACAAAATTCG ATTTATATAC AGTTCATATT GAAGTGATAT 
1 81 AGTAAGGTTA AAGAAAAAAT ATAGAAGGAA ATAAACATGT TTGCATCAAA AAGCGAAAGA 
241 AAAGTACATT ATTCAATTCG TAAATTTAGT GTTGGAGTAG CTAGTGTAGT TGTTGCCAGT 
301 CTTGTTATGG GAAGTGTGGT TCATGCGACA GAGAACGAGG GAGCTACCCA 
AGTACCCACT 

361 TCTTCTAATA GGGCAAATGA AAGTCAGGCA GAACAAGGAG AACAACCTAA AAAACTCGAT 
421 TCAGAACGAG ATAAGGCAAG GAAAGAGGTC GAGGAATATG TAAAAAAAAT AGTGGGTGAG 
481 AGCTATGCAA AATCAACTAA AAAGCGACAT ACAATTACTG TAGCTCTAGT TAACGAGTTG 
541 AACAACATTA AGAACGAGTA TTTGAATAAA ATAGTTGAAT CAACCTCAGA AAGCCAACTA 
601 CAGATACTGA TGATGGAGAG TCGATCAAAA GTAGATGAAG CTGTGTCTAA GTTTGAAAAG 
661 GACTCATCTT CTTCGTCAAG TTCAGACTCT TCCACTAAAC CGGAAGCTTC AGATACAGCG 
721 AAGCCAAACA AGCCGACAGA ACCAGGAGAA AAGGTAGCAG AAGCTAAGAA 
GAAGGTTGAA 

781 GAAGCTGAGA AAAAAGCCAA GGATCAAAAA GAAGAAGATC GTCGTAACTA CCCAACCATT 
841 ACTTACAAAA CGCTTGAACT TGAAATTGCT GAGTCCGATG TGGAAGTTAA AAAAGCGGAG 
901 CTTGAACTAG TAAAAGTGAA AGCTAACGAA CCTCGAGACG AGCAAAAAAT TAAGCAAGCA 
961 GAAGCGGAAG TTGAGAGTAA ACAAGCTGAG GCTACAAGGT TAAAAAAAAT CAAGACAGAT 
1021 CGTGAAGAAG CAGAAGAAGA AGCTAAACGA AGAGCAGATG CTAAAGAGCA 
AGGTAAACCA 

1081 AAGGGGCGGG CAAAACGAGG AGTTCCTGGA GAGCTAGCAA CACCTGATAA 
AAAAGAAAAT 

1 141 GATGCGAAGT CTTCAGATTC TAGCGTAGGT GAAGAAACTC TTCCAAGCCC ATCCCTGAAA 
1201 CCAGAAAAAA AGGTAGCAGA AGCTGAGAAG AAGGTTGAAG AAGCTAAGAA 
AAAAGCCGAG 

1261 GATCAAAAAG AAGAAGATCG CCGTAACTAC CCAACCAATA CTTACAAAAC GCTTGAACTT 
1321 GAAATTGCTG AGTCCGATGT GGAAGTTAAA AAAGCGGAGC TTGAACTAGT AAAAGAGGAA 
1381 GCTAAGGAAC CTCGAAACGA GGAAAAAGTT AAGCAAGCAA AAGCGGAAGT 
TGAGAGTAAA 

1441 AAAGCTGAGG CTACTAGGTT AGAAAAAATC AAGACAGATC GTAAAAAAGC AGAAGAAGAA 
1501 GCTAAACGAA AAGCAGCAGA AGAAGATAAA GTTAAAGAAA AACCAGCTGA ACAACCACAA 
1561 CCAGCGCCGG CTCCAAAAGC AGAAAAACCA GCTCCAGCTC CAAAACCAGA 
GAATCCAGCT 

1621 GAACAACCAA AAGCAGAAAA ACCAGCTGAT CAACAAGCTG AAGAAGAGTA TGCTCGTAGA 
1681 TCAGAAGAAG AATATAATCG CTTGACTCTA CAGCAACCGC CAAAAACTGA AAAACCAGCA 
1741 CAACCATCTA CTCCAAAAAC AAANAC 
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DEFINITION - 
ACCESSION 
KEYWORDS 
SOURCE - 
FEATURES 



LOCUS 



DBL6A.DNA 1 590 BP SS-DNA 



SYN 



CDS 



Location/Qualifiers 
127.. 1590 



BASE COUNT 701 A 261 C 339 G 288 T 1 OTHER 
ORIGIN - 

1 AAACTATTAC ATGAAGTTAT AATCCCAAAT GGAAGCATAA AGAGATAAAT ACAAAATTCG 
61 ATTTATATAC AGTTCATATT GAAGTGATAT AGTAAGGTTA AAGAAAAAAT ATAGAAGGAA 
121 ATAATTATGT TTGCATCYAA AAGCGAAAGA AAAGTACATT ATTCAATTCG TAAATTTAGT 
181 ATTGGAGTAG CTAGTGTAGC TGTTGCTAGC TTGTTCTTAG GAGGAGTAGT CCATGCAGAA 
241 GGGGTTAGAA GTGAGAATAC CCCCAAGGTT ACATCTAGTG GGGATGAAGT CGATGAATAT 
301 ATAAAAAAAA TGTTGAGTGA GATCCAATTA GATAAAAGAA AACATACCCA CAATTTCGCC 
361 TTAAACCTAA AGTTGAGCAG AATTAAAACG GAGTATTTGT ATAAATTAAA AGTTAATGTT 
421 TTAGAAGAAA AGTCAAAAGC TGAGTTGACG TCAAAAACAA AAAAAGAGGT AGACGCAGCT 
481 TTTGAGAAGT TTAAAAAAGA TACATTGAAA CTAGGAGAAA AGGTAGCAGA AGCTCAGAAG 
541 AAGGTTGAAG AAGCTAAGAA AAAAGCCAAG GATCAAAAAG AAGAAGATCA CCGTAACTAC 
601 CCAACCAATA CTTACAAAAC GCTTGAACTT GAAATTGCTG AGTCCGATGT GAAAGTTAAA 
661 GAAGCGGAGC TTGAACTATT GAAAGAGGAA GCTAAAACTC GAAACGAGGA CACAATTAAC 
721 CAAGCAAAAG CGAAAGTTAA GAGTGAACAA GCTGAGGCTA CAAGGTTAAA AAAAATCAAG 
781 ACAGATCGTG AACAAGCTGA GGCTACAAGG TTAGAAAACA TCAAGACAGA TCGTGAAAAA 
841 GCAGAAGAAG CTAAACGAAA AGCAGAAGCA GAAGAAGTTA AAGATAAACT AAAGAGGCGG 
901 ACAAAACGAG CAGTTCCTGG AGAGCCAGCA ACACCTGATA AAAAAGAAAA TGATGCGAAG 
961 TCTTCAGATT CTAGCGTAGG TGAAGAAACT CTTCCAAGCC CATCCCTGAA ATCAGGAAAA 
1021 AAGGTAGCAG AAGCTCAGAA GAAGGTAGCA GAAGCTGAGA AAAAAGCCAA 
GGATCAAAAA 

1081 GAAGAAGATC GCCGTAACTA CCCAACCAAT ACTTACAAAA CGCTTGACCT TGAAATTGCT 
1141 GAGTCCGATG TGAAAGTTAA AGAAGGGGAG CTTGAACTAG TAAAAGAGGA 
AGCTAAGGAA 

1201 TCTCGAAACG AGGAAAAAGT TAAGCAAGCA AAAGCGAAAG TTGAGAGTAA AAAAGCTGAG 
1261 GCTACAAGGT TAGAAAAAAT CAAGACAGAT CGTAAAAAAG CAGAAGAAGC TAAACGAAGA 
1321 GCAGCAGAAG AAGATAAAGT TAAAGAAAAA CCAGCTGAAC AACCACAACC 
AGCGCCGGCT 

1381 CCTCAACCAG AAAAACCAAC TGAAGAGCCT GAGAATCCAG CTCCAGCTCC 
AAAAGCTGAG 

1441 AATCCAGGTG AACAACCAAA AGGAGAAAAA CCAGCTGATC AACAAGCTGA AGAAGACTAT 
1 501 GCTGGTAGAT CAGAAGAAGA ATATAATGGG TTGACTCAAC AGCAACCGCC AAAAACTGAA 
1561 AAACCAGCAC AACCATCTAC TCCAAAAACA 
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